Semi-supervised classification and clustering analysis for variable stars
نویسندگان
چکیده
The immense amount of time series data produced by astronomical surveys has called for the use machine learning algorithms to discover and classify several million celestial sources. In case variable stars, supervised approaches have become commonplace. However, this needs a considerable collection expert-labeled light curves achieve adequate performance, which is costly construct. To solve problem, we introduce two approaches. First, semi-supervised hierarchical method, requires substantially less trained than methods. Second, clustering analysis procedure that finds groups may correspond classes or sub-classes stars. Both methods are primarily supported dimensionality reduction visualization avoid curse dimensionality. We tested our with catalogs collected from OGLE, CSS, Gaia surveys. method reaches performance around 90\% all three selected stars using only $5\%$ in training. This suitable classifying main when there small training data. Our confirms most clusters found purity over respect 80\% sub-classes, suggesting type can be used large-scale variability as an initial step identify present and/or build sets, among many other possible applications.
منابع مشابه
Semi-supervised Variable Weighting for Clustering
Semi-supervised learning, which uses a small amount of labeled data in conjunction with a large amount of unlabeled data for training, has recently attracted huge research attention due to the considerable improvement in learning accuracy. In this work, we focus on semisupervised variable weighting for clustering, which is a critical step in clustering as it is known that interesting clustering...
متن کاملSemi-supervised Spectral Clustering for Classification
We propose a Classification Via Clustering (CVC) algorithm which enables existing clustering methods to be efficiently employed in classification problems. In CVC, training and test data are coclustered and class-cluster distributions are used to find the label of the test data. To determine an efficient number of clusters, a Semi-supervised Hierarchical Clustering (SHC) algorithm is proposed. ...
متن کاملUsing clustering analysis to improve semi-supervised classification
Semi-supervised classification has become an active topic recently and a number of algorithms, such as Self-training, have been proposed to improve the performance of supervised classification using unlabeled data. In this paper, we propose a semi-supervised learning framework which combines clustering and classification. Our motivation is that clustering analysis is a powerful knowledge-discov...
متن کاملClustering-Based Stratified Seed Sampling for Semi-Supervised Relation Classification
Seed sampling is critical in semi-supervised learning. This paper proposes a clusteringbased stratified seed sampling approach to semi-supervised learning. First, various clustering algorithms are explored to partition the unlabeled instances into different strata with each stratum represented by a center. Then, diversity-motivated intra-stratum sampling is adopted to choose the center and addi...
متن کاملA Semi-Supervised Clustering Method For P2P Traffic Classification
In the last years, the use of P2P applications has increased significantly and currently they represent a significant portion of the Internet traffic. In consequence of this growth, P2P traffic identification and classification are becoming increasingly important for network administrators and designers. However, this classification was not simple. Nowadays, P2P applications explicitly tried to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Monthly Notices of the Royal Astronomical Society
سال: 2022
ISSN: ['0035-8711', '1365-8711', '1365-2966']
DOI: https://doi.org/10.1093/mnras/stac2715